Overview
Brought to you by YData
Dataset statistics
| Number of variables | 22 |
|---|---|
| Number of observations | 280790 |
| Missing cells | 382536 |
| Missing cells (%) | 6.2% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 45.3 MiB |
| Average record size in memory | 169.0 B |
Variable types
| Numeric | 14 |
|---|---|
| Categorical | 6 |
| DateTime | 1 |
| Boolean | 1 |
BAIXO_PESO is highly overall correlated with GESTACAO and 2 other fields | High correlation |
CONSPRENAT is highly overall correlated with CONSULTAS and 1 other fields | High correlation |
CONSULTAS is highly overall correlated with CONSPRENAT and 2 other fields | High correlation |
GESTACAO is highly overall correlated with BAIXO_PESO and 1 other fields | High correlation |
KOTELCHUCK is highly overall correlated with CONSPRENAT and 1 other fields | High correlation |
MESPRENAT is highly overall correlated with CONSULTAS | High correlation |
PARTO is highly overall correlated with STCESPARTO | High correlation |
PESO is highly overall correlated with BAIXO_PESO | High correlation |
QTDFILVIVO is highly overall correlated with QTDPARTNOR | High correlation |
QTDPARTNOR is highly overall correlated with QTDFILVIVO | High correlation |
RACACOR is highly overall correlated with RACACORMAE | High correlation |
RACACORMAE is highly overall correlated with RACACOR | High correlation |
SEMAGESTAC is highly overall correlated with BAIXO_PESO and 1 other fields | High correlation |
STCESPARTO is highly overall correlated with PARTO | High correlation |
GRAVIDEZ is highly imbalanced (92.3%) | Imbalance |
BAIXO_PESO is highly imbalanced (57.0%) | Imbalance |
ESCMAE2010 has 3803 (1.4%) missing values | Missing |
CONSPRENAT has 5240 (1.9%) missing values | Missing |
MESPRENAT has 7911 (2.8%) missing values | Missing |
QTDPARTNOR has 13723 (4.9%) missing values | Missing |
QTDPARTCES has 15290 (5.4%) missing values | Missing |
STCESPARTO has 30253 (10.8%) missing values | Missing |
SEMAGESTAC has 4521 (1.6%) missing values | Missing |
GESTACAO has 4365 (1.6%) missing values | Missing |
SEXO has 251000 (89.4%) missing values | Missing |
RACACORMAE has 10457 (3.7%) missing values | Missing |
RACACOR has 8859 (3.2%) missing values | Missing |
QTDFILVIVO has 9755 (3.5%) missing values | Missing |
QTDFILMORT has 14981 (5.3%) missing values | Missing |
QTDPARTCES is highly skewed (γ1 = 28.34834527) | Skewed |
QTDFILMORT is highly skewed (γ1 = 29.90289762) | Skewed |
QTDPARTNOR has 171113 (60.9%) zeros | Zeros |
QTDPARTCES has 189675 (67.6%) zeros | Zeros |
QTDFILVIVO has 114210 (40.7%) zeros | Zeros |
QTDFILMORT has 213857 (76.2%) zeros | Zeros |
Reproduction
| Analysis started | 2025-09-30 18:20:17.034375 |
|---|---|
| Analysis finished | 2025-09-30 18:21:02.718112 |
| Duration | 45.68 seconds |
| Software version | ydata-profiling vv4.17.0 |
| Download configuration | config.json |
Variables
PESO
Real number (ℝ)
High correlation
| Distinct | 3224 |
|---|---|
| Distinct (%) | 1.1% |
| Missing | 83 |
| Missing (%) | < 0.1% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 3177.8499 |
| Minimum | 100 |
|---|---|
| Maximum | 9999 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 2.1 MiB |
Quantile statistics
| Minimum | 100 |
|---|---|
| 5-th percentile | 2240 |
| Q1 | 2900 |
| median | 3210 |
| Q3 | 3520 |
| 95-th percentile | 3995 |
| Maximum | 9999 |
| Range | 9899 |
| Interquartile range (IQR) | 620 |
Descriptive statistics
| Standard deviation | 563.1771 |
|---|---|
| Coefficient of variation (CV) | 0.17721954 |
| Kurtosis | 3.2091692 |
| Mean | 3177.8499 |
| Median Absolute Deviation (MAD) | 310 |
| Skewness | -0.89115248 |
| Sum | 8.9204471 × 108 |
| Variance | 317168.45 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 3200 | 2478 | 0.9% |
| 3000 | 2449 | 0.9% |
| 3300 | 2387 | 0.9% |
| 3100 | 2309 | 0.8% |
| 3400 | 2203 | 0.8% |
| 3500 | 2079 | 0.7% |
| 3250 | 1853 | 0.7% |
| 3600 | 1831 | 0.7% |
| 3150 | 1806 | 0.6% |
| 3350 | 1788 | 0.6% |
| Other values (3214) | 259524 |
| Value | Count | Frequency (%) |
| 100 | 1 | |
| 105 | 1 | |
| 110 | 1 | |
| 123 | 1 | |
| 130 | 1 | |
| 151 | 1 | |
| 155 | 1 | |
| 159 | 1 | |
| 185 | 1 | |
| 200 | 1 |
| Value | Count | Frequency (%) |
| 9999 | 2 | |
| 6840 | 1 | |
| 6805 | 1 | |
| 6781 | 1 | |
| 6550 | 1 | |
| 6500 | 2 | |
| 6399 | 1 | |
| 6300 | 1 | |
| 6200 | 2 | |
| 6065 | 1 |
IDADEMAE
Real number (ℝ)
| Distinct | 53 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 4 |
| Missing (%) | < 0.1% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 26.910159 |
| Minimum | 11 |
|---|---|
| Maximum | 99 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 2.1 MiB |
Quantile statistics
| Minimum | 11 |
|---|---|
| 5-th percentile | 17 |
| Q1 | 21 |
| median | 27 |
| Q3 | 32 |
| 95-th percentile | 38 |
| Maximum | 99 |
| Range | 88 |
| Interquartile range (IQR) | 11 |
Descriptive statistics
| Standard deviation | 6.7508597 |
|---|---|
| Coefficient of variation (CV) | 0.25086658 |
| Kurtosis | -0.60371717 |
| Mean | 26.910159 |
| Median Absolute Deviation (MAD) | 5 |
| Skewness | 0.24802832 |
| Sum | 7555996 |
| Variance | 45.574107 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 23 | 14101 | 5.0% |
| 22 | 14086 | 5.0% |
| 21 | 13944 | 5.0% |
| 25 | 13784 | 4.9% |
| 26 | 13720 | 4.9% |
| 27 | 13643 | 4.9% |
| 24 | 13593 | 4.8% |
| 28 | 13547 | 4.8% |
| 20 | 13454 | 4.8% |
| 29 | 13027 | 4.6% |
| Other values (43) | 143887 |
| Value | Count | Frequency (%) |
| 11 | 6 | < 0.1% |
| 12 | 48 | < 0.1% |
| 13 | 404 | 0.1% |
| 14 | 1660 | 0.6% |
| 15 | 3734 | 1.3% |
| 16 | 6368 | |
| 17 | 8449 | |
| 18 | 10406 | |
| 19 | 12125 | |
| 20 | 13454 |
| Value | Count | Frequency (%) |
| 99 | 2 | |
| 63 | 1 | |
| 62 | 2 | |
| 61 | 1 | |
| 60 | 1 | |
| 59 | 1 | |
| 57 | 2 | |
| 56 | 1 | |
| 55 | 2 | |
| 54 | 2 |
ESCMAE2010
Real number (ℝ)
Missing
| Distinct | 7 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 3803 |
| Missing (%) | 1.4% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 3.0783575 |
| Minimum | 0 |
|---|---|
| Maximum | 9 |
| Zeros | 1233 |
| Zeros (%) | 0.4% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 2.1 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 2 |
| Q1 | 2 |
| median | 3 |
| Q3 | 3 |
| 95-th percentile | 5 |
| Maximum | 9 |
| Range | 9 |
| Interquartile range (IQR) | 1 |
Descriptive statistics
| Standard deviation | 1.1370102 |
|---|---|
| Coefficient of variation (CV) | 0.36935612 |
| Kurtosis | 2.9441273 |
| Mean | 3.0783575 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 0.99134221 |
| Sum | 852665 |
| Variance | 1.2927922 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 3 | 141849 | |
| 2 | 63383 | |
| 5 | 44122 | 15.7% |
| 4 | 14010 | 5.0% |
| 1 | 10976 | 3.9% |
| 9 | 1414 | 0.5% |
| 0 | 1233 | 0.4% |
| (Missing) | 3803 | 1.4% |
| Value | Count | Frequency (%) |
| 0 | 1233 | 0.4% |
| 1 | 10976 | 3.9% |
| 2 | 63383 | |
| 3 | 141849 | |
| 4 | 14010 | 5.0% |
| 5 | 44122 | 15.7% |
| 9 | 1414 | 0.5% |
| Value | Count | Frequency (%) |
| 9 | 1414 | 0.5% |
| 5 | 44122 | 15.7% |
| 4 | 14010 | 5.0% |
| 3 | 141849 | |
| 2 | 63383 | |
| 1 | 10976 | 3.9% |
| 0 | 1233 | 0.4% |
CONSULTAS
Categorical
High correlation
| Distinct | 5 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 113 |
| Missing (%) | < 0.1% |
| Memory size | 2.1 MiB |
| 4.0 | |
|---|---|
| 3.0 | |
| 2.0 | 15941 |
| 1.0 | 5194 |
| 9.0 | 1469 |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 3.0 |
|---|---|
| 2nd row | 3.0 |
| 3rd row | 3.0 |
| 4th row | 4.0 |
| 5th row | 3.0 |
Common Values
| Value | Count | Frequency (%) |
| 4.0 | 198120 | |
| 3.0 | 59953 | 21.4% |
| 2.0 | 15941 | 5.7% |
| 1.0 | 5194 | 1.8% |
| 9.0 | 1469 | 0.5% |
| (Missing) | 113 | < 0.1% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 4.0 | 198120 | |
| 3.0 | 59953 | 21.4% |
| 2.0 | 15941 | 5.7% |
| 1.0 | 5194 | 1.9% |
| 9.0 | 1469 | 0.5% |
Most occurring characters
| Value | Count | Frequency (%) |
| . | 280677 | |
| 0 | 280677 | |
| 4 | 198120 | |
| 3 | 59953 | 7.1% |
| 2 | 15941 | 1.9% |
| 1 | 5194 | 0.6% |
| 9 | 1469 | 0.2% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 842031 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| . | 280677 | |
| 0 | 280677 | |
| 4 | 198120 | |
| 3 | 59953 | 7.1% |
| 2 | 15941 | 1.9% |
| 1 | 5194 | 0.6% |
| 9 | 1469 | 0.2% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 842031 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| . | 280677 | |
| 0 | 280677 | |
| 4 | 198120 | |
| 3 | 59953 | 7.1% |
| 2 | 15941 | 1.9% |
| 1 | 5194 | 0.6% |
| 9 | 1469 | 0.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 842031 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| . | 280677 | |
| 0 | 280677 | |
| 4 | 198120 | |
| 3 | 59953 | 7.1% |
| 2 | 15941 | 1.9% |
| 1 | 5194 | 0.6% |
| 9 | 1469 | 0.2% |
CONSPRENAT
Real number (ℝ)
High correlation Missing
| Distinct | 48 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 5240 |
| Missing (%) | 1.9% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 8.5709998 |
| Minimum | 0 |
|---|---|
| Maximum | 99 |
| Zeros | 1642 |
| Zeros (%) | 0.6% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 2.1 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 3 |
| Q1 | 6 |
| median | 8 |
| Q3 | 10 |
| 95-th percentile | 13 |
| Maximum | 99 |
| Range | 99 |
| Interquartile range (IQR) | 4 |
Descriptive statistics
| Standard deviation | 7.2351981 |
|---|---|
| Coefficient of variation (CV) | 0.84414868 |
| Kurtosis | 124.90158 |
| Mean | 8.5709998 |
| Median Absolute Deviation (MAD) | 2 |
| Skewness | 10.231781 |
| Sum | 2361739 |
| Variance | 52.348092 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 8 | 40167 | |
| 10 | 39218 | |
| 9 | 36226 | |
| 7 | 35036 | |
| 6 | 28108 | |
| 5 | 18804 | |
| 11 | 15914 | 5.7% |
| 12 | 14677 | 5.2% |
| 4 | 12672 | 4.5% |
| 3 | 8082 | 2.9% |
| Other values (38) | 26646 |
| Value | Count | Frequency (%) |
| 0 | 1642 | 0.6% |
| 1 | 2780 | 1.0% |
| 2 | 4968 | 1.8% |
| 3 | 8082 | 2.9% |
| 4 | 12672 | 4.5% |
| 5 | 18804 | |
| 6 | 28108 | |
| 7 | 35036 | |
| 8 | 40167 | |
| 9 | 36226 |
| Value | Count | Frequency (%) |
| 99 | 1440 | |
| 77 | 1 | < 0.1% |
| 75 | 1 | < 0.1% |
| 69 | 1 | < 0.1% |
| 64 | 1 | < 0.1% |
| 63 | 1 | < 0.1% |
| 42 | 2 | < 0.1% |
| 41 | 10 | < 0.1% |
| 40 | 33 | < 0.1% |
| 39 | 43 | < 0.1% |
MESPRENAT
Real number (ℝ)
High correlation Missing
| Distinct | 11 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 7911 |
| Missing (%) | 2.8% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 5.1011584 |
| Minimum | 1 |
|---|---|
| Maximum | 99 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 2.1 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 2 |
| median | 2 |
| Q3 | 3 |
| 95-th percentile | 6 |
| Maximum | 99 |
| Range | 98 |
| Interquartile range (IQR) | 1 |
Descriptive statistics
| Standard deviation | 15.65202 |
|---|---|
| Coefficient of variation (CV) | 3.0683266 |
| Kurtosis | 31.738727 |
| Mean | 5.1011584 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 5.7829172 |
| Sum | 1391999 |
| Variance | 244.98574 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 2 | 103790 | |
| 1 | 60543 | |
| 3 | 53353 | |
| 4 | 22827 | 8.1% |
| 5 | 12456 | 4.4% |
| 99 | 7318 | 2.6% |
| 6 | 6295 | 2.2% |
| 7 | 3452 | 1.2% |
| 8 | 1815 | 0.6% |
| 9 | 1007 | 0.4% |
| (Missing) | 7911 | 2.8% |
| Value | Count | Frequency (%) |
| 1 | 60543 | |
| 2 | 103790 | |
| 3 | 53353 | |
| 4 | 22827 | 8.1% |
| 5 | 12456 | 4.4% |
| 6 | 6295 | 2.2% |
| 7 | 3452 | 1.2% |
| 8 | 1815 | 0.6% |
| 9 | 1007 | 0.4% |
| 10 | 23 | < 0.1% |
| Value | Count | Frequency (%) |
| 99 | 7318 | 2.6% |
| 10 | 23 | < 0.1% |
| 9 | 1007 | 0.4% |
| 8 | 1815 | 0.6% |
| 7 | 3452 | 1.2% |
| 6 | 6295 | 2.2% |
| 5 | 12456 | 4.4% |
| 4 | 22827 | 8.1% |
| 3 | 53353 | |
| 2 | 103790 |
KOTELCHUCK
Real number (ℝ)
High correlation
| Distinct | 6 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 4.4341465 |
| Minimum | 1 |
|---|---|
| Maximum | 9 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 2.1 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 2 |
| Q1 | 3 |
| median | 5 |
| Q3 | 5 |
| 95-th percentile | 9 |
| Maximum | 9 |
| Range | 8 |
| Interquartile range (IQR) | 2 |
Descriptive statistics
| Standard deviation | 1.5964881 |
|---|---|
| Coefficient of variation (CV) | 0.36004406 |
| Kurtosis | 1.5638664 |
| Mean | 4.4341465 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 0.47956809 |
| Sum | 1245064 |
| Variance | 2.5487744 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 5 | 175001 | |
| 2 | 50146 | 17.9% |
| 4 | 20052 | 7.1% |
| 3 | 19604 | 7.0% |
| 9 | 14345 | 5.1% |
| 1 | 1642 | 0.6% |
| Value | Count | Frequency (%) |
| 1 | 1642 | 0.6% |
| 2 | 50146 | 17.9% |
| 3 | 19604 | 7.0% |
| 4 | 20052 | 7.1% |
| 5 | 175001 | |
| 9 | 14345 | 5.1% |
| Value | Count | Frequency (%) |
| 9 | 14345 | 5.1% |
| 5 | 175001 | |
| 4 | 20052 | 7.1% |
| 3 | 19604 | 7.0% |
| 2 | 50146 | 17.9% |
| 1 | 1642 | 0.6% |
GRAVIDEZ
Categorical
Imbalance
| Distinct | 4 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 320 |
| Missing (%) | 0.1% |
| Memory size | 2.1 MiB |
| 1.0 | |
|---|---|
| 2.0 | 5967 |
| 3.0 | 126 |
| 9.0 | 11 |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 1.0 |
|---|---|
| 2nd row | 1.0 |
| 3rd row | 1.0 |
| 4th row | 1.0 |
| 5th row | 1.0 |
Common Values
| Value | Count | Frequency (%) |
| 1.0 | 274366 | |
| 2.0 | 5967 | 2.1% |
| 3.0 | 126 | < 0.1% |
| 9.0 | 11 | < 0.1% |
| (Missing) | 320 | 0.1% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 1.0 | 274366 | |
| 2.0 | 5967 | 2.1% |
| 3.0 | 126 | < 0.1% |
| 9.0 | 11 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| . | 280470 | |
| 0 | 280470 | |
| 1 | 274366 | |
| 2 | 5967 | 0.7% |
| 3 | 126 | < 0.1% |
| 9 | 11 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 841410 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| . | 280470 | |
| 0 | 280470 | |
| 1 | 274366 | |
| 2 | 5967 | 0.7% |
| 3 | 126 | < 0.1% |
| 9 | 11 | < 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 841410 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| . | 280470 | |
| 0 | 280470 | |
| 1 | 274366 | |
| 2 | 5967 | 0.7% |
| 3 | 126 | < 0.1% |
| 9 | 11 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 841410 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| . | 280470 | |
| 0 | 280470 | |
| 1 | 274366 | |
| 2 | 5967 | 0.7% |
| 3 | 126 | < 0.1% |
| 9 | 11 | < 0.1% |
QTDPARTNOR
Real number (ℝ)
High correlation Missing Zeros
| Distinct | 26 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 13723 |
| Missing (%) | 4.9% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.69567562 |
| Minimum | 0 |
|---|---|
| Maximum | 99 |
| Zeros | 171113 |
| Zeros (%) | 60.9% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 2.1 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 1 |
| 95-th percentile | 3 |
| Maximum | 99 |
| Range | 99 |
| Interquartile range (IQR) | 1 |
Descriptive statistics
| Standard deviation | 1.3122063 |
|---|---|
| Coefficient of variation (CV) | 1.886233 |
| Kurtosis | 386.12389 |
| Mean | 0.69567562 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 7.888942 |
| Sum | 185792 |
| Variance | 1.7218854 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 171113 | |
| 1 | 50693 | 18.1% |
| 2 | 23801 | 8.5% |
| 3 | 10821 | 3.9% |
| 4 | 5114 | 1.8% |
| 5 | 2589 | 0.9% |
| 6 | 1326 | 0.5% |
| 7 | 730 | 0.3% |
| 8 | 386 | 0.1% |
| 9 | 226 | 0.1% |
| Other values (16) | 268 | 0.1% |
| (Missing) | 13723 | 4.9% |
| Value | Count | Frequency (%) |
| 0 | 171113 | |
| 1 | 50693 | 18.1% |
| 2 | 23801 | 8.5% |
| 3 | 10821 | 3.9% |
| 4 | 5114 | 1.8% |
| 5 | 2589 | 0.9% |
| 6 | 1326 | 0.5% |
| 7 | 730 | 0.3% |
| 8 | 386 | 0.1% |
| 9 | 226 | 0.1% |
| Value | Count | Frequency (%) |
| 99 | 3 | |
| 58 | 1 | < 0.1% |
| 39 | 2 | < 0.1% |
| 30 | 2 | < 0.1% |
| 23 | 2 | < 0.1% |
| 22 | 2 | < 0.1% |
| 21 | 3 | |
| 20 | 5 | |
| 17 | 1 | < 0.1% |
| 16 | 1 | < 0.1% |
QTDPARTCES
Real number (ℝ)
Missing Skewed Zeros
| Distinct | 21 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 15290 |
| Missing (%) | 5.4% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.37511111 |
| Minimum | 0 |
|---|---|
| Maximum | 99 |
| Zeros | 189675 |
| Zeros (%) | 67.6% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 2.1 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 1 |
| 95-th percentile | 2 |
| Maximum | 99 |
| Range | 99 |
| Interquartile range (IQR) | 1 |
Descriptive statistics
| Standard deviation | 0.78792334 |
|---|---|
| Coefficient of variation (CV) | 2.1005065 |
| Kurtosis | 3085.9937 |
| Mean | 0.37511111 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 28.348345 |
| Sum | 99592 |
| Variance | 0.62082318 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 189675 | |
| 1 | 57204 | 20.4% |
| 2 | 15097 | 5.4% |
| 3 | 2904 | 1.0% |
| 4 | 476 | 0.2% |
| 5 | 67 | < 0.1% |
| 6 | 28 | < 0.1% |
| 10 | 15 | < 0.1% |
| 20 | 9 | < 0.1% |
| 7 | 4 | < 0.1% |
| Other values (11) | 21 | < 0.1% |
| (Missing) | 15290 | 5.4% |
| Value | Count | Frequency (%) |
| 0 | 189675 | |
| 1 | 57204 | 20.4% |
| 2 | 15097 | 5.4% |
| 3 | 2904 | 1.0% |
| 4 | 476 | 0.2% |
| 5 | 67 | < 0.1% |
| 6 | 28 | < 0.1% |
| 7 | 4 | < 0.1% |
| 8 | 1 | < 0.1% |
| 9 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 99 | 3 | < 0.1% |
| 70 | 1 | < 0.1% |
| 41 | 1 | < 0.1% |
| 32 | 2 | < 0.1% |
| 25 | 3 | < 0.1% |
| 23 | 1 | < 0.1% |
| 22 | 3 | < 0.1% |
| 20 | 9 | |
| 14 | 3 | < 0.1% |
| 11 | 2 | < 0.1% |
PARTO
Categorical
High correlation
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 213 |
| Missing (%) | 0.1% |
| Memory size | 2.1 MiB |
| 2.0 | |
|---|---|
| 1.0 | |
| 9.0 | 11 |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 1.0 |
|---|---|
| 2nd row | 2.0 |
| 3rd row | 1.0 |
| 4th row | 1.0 |
| 5th row | 1.0 |
Common Values
| Value | Count | Frequency (%) |
| 2.0 | 159557 | |
| 1.0 | 121009 | |
| 9.0 | 11 | < 0.1% |
| (Missing) | 213 | 0.1% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 2.0 | 159557 | |
| 1.0 | 121009 | |
| 9.0 | 11 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| . | 280577 | |
| 0 | 280577 | |
| 2 | 159557 | |
| 1 | 121009 | |
| 9 | 11 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 841731 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| . | 280577 | |
| 0 | 280577 | |
| 2 | 159557 | |
| 1 | 121009 | |
| 9 | 11 | < 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 841731 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| . | 280577 | |
| 0 | 280577 | |
| 2 | 159557 | |
| 1 | 121009 | |
| 9 | 11 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 841731 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| . | 280577 | |
| 0 | 280577 | |
| 2 | 159557 | |
| 1 | 121009 | |
| 9 | 11 | < 0.1% |
STCESPARTO
Categorical
High correlation Missing
| Distinct | 4 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 30253 |
| Missing (%) | 10.8% |
| Memory size | 2.1 MiB |
| 3.0 | |
|---|---|
| 1.0 | |
| 2.0 | |
| 9.0 | 7245 |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 2.0 |
|---|---|
| 2nd row | 2.0 |
| 3rd row | 9.0 |
| 4th row | 2.0 |
| 5th row | 1.0 |
Common Values
| Value | Count | Frequency (%) |
| 3.0 | 95845 | |
| 1.0 | 79117 | |
| 2.0 | 68330 | |
| 9.0 | 7245 | 2.6% |
| (Missing) | 30253 | 10.8% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 3.0 | 95845 | |
| 1.0 | 79117 | |
| 2.0 | 68330 | |
| 9.0 | 7245 | 2.9% |
Most occurring characters
| Value | Count | Frequency (%) |
| . | 250537 | |
| 0 | 250537 | |
| 3 | 95845 | 12.8% |
| 1 | 79117 | 10.5% |
| 2 | 68330 | 9.1% |
| 9 | 7245 | 1.0% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 751611 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| . | 250537 | |
| 0 | 250537 | |
| 3 | 95845 | 12.8% |
| 1 | 79117 | 10.5% |
| 2 | 68330 | 9.1% |
| 9 | 7245 | 1.0% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 751611 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| . | 250537 | |
| 0 | 250537 | |
| 3 | 95845 | 12.8% |
| 1 | 79117 | 10.5% |
| 2 | 68330 | 9.1% |
| 9 | 7245 | 1.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 751611 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| . | 250537 | |
| 0 | 250537 | |
| 3 | 95845 | 12.8% |
| 1 | 79117 | 10.5% |
| 2 | 68330 | 9.1% |
| 9 | 7245 | 1.0% |
SEMAGESTAC
Real number (ℝ)
High correlation Missing
| Distinct | 27 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 4521 |
| Missing (%) | 1.6% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 38.45005 |
| Minimum | 19 |
|---|---|
| Maximum | 45 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 2.1 MiB |
Quantile statistics
| Minimum | 19 |
|---|---|
| 5-th percentile | 35 |
| Q1 | 38 |
| median | 39 |
| Q3 | 40 |
| 95-th percentile | 41 |
| Maximum | 45 |
| Range | 26 |
| Interquartile range (IQR) | 2 |
Descriptive statistics
| Standard deviation | 2.2330566 |
|---|---|
| Coefficient of variation (CV) | 0.058076819 |
| Kurtosis | 10.887931 |
| Mean | 38.45005 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | -2.2888255 |
| Sum | 10622557 |
| Variance | 4.986542 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 39 | 80071 | |
| 38 | 58129 | |
| 40 | 52626 | |
| 37 | 26475 | 9.4% |
| 41 | 20394 | 7.3% |
| 36 | 12017 | 4.3% |
| 35 | 6582 | 2.3% |
| 42 | 4703 | 1.7% |
| 34 | 4131 | 1.5% |
| 33 | 2548 | 0.9% |
| Other values (17) | 8593 | 3.1% |
| (Missing) | 4521 | 1.6% |
| Value | Count | Frequency (%) |
| 19 | 29 | < 0.1% |
| 20 | 45 | < 0.1% |
| 21 | 79 | < 0.1% |
| 22 | 122 | < 0.1% |
| 23 | 159 | 0.1% |
| 24 | 219 | |
| 25 | 234 | |
| 26 | 336 | |
| 27 | 343 | |
| 28 | 470 |
| Value | Count | Frequency (%) |
| 45 | 345 | 0.1% |
| 44 | 662 | 0.2% |
| 43 | 1403 | 0.5% |
| 42 | 4703 | 1.7% |
| 41 | 20394 | 7.3% |
| 40 | 52626 | |
| 39 | 80071 | |
| 38 | 58129 | |
| 37 | 26475 | 9.4% |
| 36 | 12017 | 4.3% |
GESTACAO
Real number (ℝ)
High correlation Missing
| Distinct | 7 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 4365 |
| Missing (%) | 1.6% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 4.8904694 |
| Minimum | 1 |
|---|---|
| Maximum | 9 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 2.1 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 4 |
| Q1 | 5 |
| median | 5 |
| Q3 | 5 |
| 95-th percentile | 5 |
| Maximum | 9 |
| Range | 8 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 0.45965157 |
|---|---|
| Coefficient of variation (CV) | 0.093989253 |
| Kurtosis | 14.412294 |
| Mean | 4.8904694 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | -2.5174079 |
| Sum | 1351848 |
| Variance | 0.21127956 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 5 | 237782 | |
| 4 | 27040 | 9.6% |
| 6 | 7114 | 2.5% |
| 3 | 2864 | 1.0% |
| 2 | 1413 | 0.5% |
| 1 | 154 | 0.1% |
| 9 | 58 | < 0.1% |
| (Missing) | 4365 | 1.6% |
| Value | Count | Frequency (%) |
| 1 | 154 | 0.1% |
| 2 | 1413 | 0.5% |
| 3 | 2864 | 1.0% |
| 4 | 27040 | 9.6% |
| 5 | 237782 | |
| 6 | 7114 | 2.5% |
| 9 | 58 | < 0.1% |
| Value | Count | Frequency (%) |
| 9 | 58 | < 0.1% |
| 6 | 7114 | 2.5% |
| 5 | 237782 | |
| 4 | 27040 | 9.6% |
| 3 | 2864 | 1.0% |
| 2 | 1413 | 0.5% |
| 1 | 154 | 0.1% |
SEXO
Categorical
Missing
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 251000 |
| Missing (%) | 89.4% |
| Memory size | 2.1 MiB |
| 1.0 | |
|---|---|
| 0.0 |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0.0 |
|---|---|
| 2nd row | 0.0 |
| 3rd row | 1.0 |
| 4th row | 0.0 |
| 5th row | 0.0 |
Common Values
| Value | Count | Frequency (%) |
| 1.0 | 15330 | 5.5% |
| 0.0 | 14460 | 5.1% |
| (Missing) | 251000 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 1.0 | 15330 | |
| 0.0 | 14460 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 44250 | |
| . | 29790 | |
| 1 | 15330 | 17.2% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 89370 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 0 | 44250 | |
| . | 29790 | |
| 1 | 15330 | 17.2% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 89370 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 0 | 44250 | |
| . | 29790 | |
| 1 | 15330 | 17.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 89370 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 0 | 44250 | |
| . | 29790 | |
| 1 | 15330 | 17.2% |
RACACORMAE
Real number (ℝ)
High correlation Missing
| Distinct | 6 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 10457 |
| Missing (%) | 3.7% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2.8234548 |
| Minimum | 1 |
|---|---|
| Maximum | 9 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 2.1 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 1 |
| median | 4 |
| Q3 | 4 |
| 95-th percentile | 4 |
| Maximum | 9 |
| Range | 8 |
| Interquartile range (IQR) | 3 |
Descriptive statistics
| Standard deviation | 1.4406854 |
|---|---|
| Coefficient of variation (CV) | 0.51025624 |
| Kurtosis | -1.5506504 |
| Mean | 2.8234548 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | -0.33021285 |
| Sum | 763273 |
| Variance | 2.0755745 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 4 | 153974 | |
| 1 | 95412 | |
| 2 | 17046 | 6.1% |
| 5 | 2521 | 0.9% |
| 3 | 1192 | 0.4% |
| 9 | 188 | 0.1% |
| (Missing) | 10457 | 3.7% |
| Value | Count | Frequency (%) |
| 1 | 95412 | |
| 2 | 17046 | 6.1% |
| 3 | 1192 | 0.4% |
| 4 | 153974 | |
| 5 | 2521 | 0.9% |
| 9 | 188 | 0.1% |
| Value | Count | Frequency (%) |
| 9 | 188 | 0.1% |
| 5 | 2521 | 0.9% |
| 4 | 153974 | |
| 3 | 1192 | 0.4% |
| 2 | 17046 | 6.1% |
| 1 | 95412 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 1 |
| 3rd row | 1 |
| 4th row | 1 |
| 5th row | 0 |
Common Values
| Value | Count | Frequency (%) |
| 1 | 172152 | |
| 0 | 108638 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 1 | 172152 | |
| 0 | 108638 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 172152 | |
| 0 | 108638 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 280790 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 1 | 172152 | |
| 0 | 108638 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 280790 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 1 | 172152 | |
| 0 | 108638 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 280790 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 1 | 172152 | |
| 0 | 108638 |
RACACOR
Real number (ℝ)
High correlation Missing
| Distinct | 6 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 8859 |
| Missing (%) | 3.2% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2.8269083 |
| Minimum | 1 |
|---|---|
| Maximum | 9 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 2.1 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 1 |
| median | 4 |
| Q3 | 4 |
| 95-th percentile | 4 |
| Maximum | 9 |
| Range | 8 |
| Interquartile range (IQR) | 3 |
Descriptive statistics
| Standard deviation | 1.4401222 |
|---|---|
| Coefficient of variation (CV) | 0.50943364 |
| Kurtosis | -1.5482295 |
| Mean | 2.8269083 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | -0.33526501 |
| Sum | 768724 |
| Variance | 2.0739519 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 4 | 155202 | |
| 1 | 95715 | |
| 2 | 17075 | 6.1% |
| 5 | 2553 | 0.9% |
| 3 | 1198 | 0.4% |
| 9 | 188 | 0.1% |
| (Missing) | 8859 | 3.2% |
| Value | Count | Frequency (%) |
| 1 | 95715 | |
| 2 | 17075 | 6.1% |
| 3 | 1198 | 0.4% |
| 4 | 155202 | |
| 5 | 2553 | 0.9% |
| 9 | 188 | 0.1% |
| Value | Count | Frequency (%) |
| 9 | 188 | 0.1% |
| 5 | 2553 | 0.9% |
| 4 | 155202 | |
| 3 | 1198 | 0.4% |
| 2 | 17075 | 6.1% |
| 1 | 95715 |
QTDFILVIVO
Real number (ℝ)
High correlation Missing Zeros
| Distinct | 21 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 9755 |
| Missing (%) | 3.5% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1.0269264 |
| Minimum | 0 |
|---|---|
| Maximum | 99 |
| Zeros | 114210 |
| Zeros (%) | 40.7% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 2.1 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 1 |
| Q3 | 2 |
| 95-th percentile | 3 |
| Maximum | 99 |
| Range | 99 |
| Interquartile range (IQR) | 2 |
Descriptive statistics
| Standard deviation | 1.3201982 |
|---|---|
| Coefficient of variation (CV) | 1.2855821 |
| Kurtosis | 343.14506 |
| Mean | 1.0269264 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 6.5097291 |
| Sum | 278333 |
| Variance | 1.7429234 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 114210 | |
| 1 | 88625 | |
| 2 | 40271 | 14.3% |
| 3 | 15424 | 5.5% |
| 4 | 6368 | 2.3% |
| 5 | 3031 | 1.1% |
| 6 | 1468 | 0.5% |
| 7 | 784 | 0.3% |
| 8 | 419 | 0.1% |
| 9 | 214 | 0.1% |
| Other values (11) | 221 | 0.1% |
| (Missing) | 9755 | 3.5% |
| Value | Count | Frequency (%) |
| 0 | 114210 | |
| 1 | 88625 | |
| 2 | 40271 | 14.3% |
| 3 | 15424 | 5.5% |
| 4 | 6368 | 2.3% |
| 5 | 3031 | 1.1% |
| 6 | 1468 | 0.5% |
| 7 | 784 | 0.3% |
| 8 | 419 | 0.1% |
| 9 | 214 | 0.1% |
| Value | Count | Frequency (%) |
| 99 | 3 | < 0.1% |
| 30 | 1 | < 0.1% |
| 20 | 2 | < 0.1% |
| 17 | 1 | < 0.1% |
| 16 | 1 | < 0.1% |
| 15 | 4 | < 0.1% |
| 14 | 7 | < 0.1% |
| 13 | 6 | < 0.1% |
| 12 | 23 | |
| 11 | 51 |
QTDFILMORT
Real number (ℝ)
Missing Skewed Zeros
| Distinct | 17 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 14981 |
| Missing (%) | 5.3% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.24927297 |
| Minimum | 0 |
|---|---|
| Maximum | 99 |
| Zeros | 213857 |
| Zeros (%) | 76.2% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 2.1 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 1 |
| Maximum | 99 |
| Range | 99 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 0.64313564 |
|---|---|
| Coefficient of variation (CV) | 2.5800456 |
| Kurtosis | 4198.619 |
| Mean | 0.24927297 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 29.902898 |
| Sum | 66259 |
| Variance | 0.41362345 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 213857 | |
| 1 | 41574 | 14.8% |
| 2 | 7900 | 2.8% |
| 3 | 1784 | 0.6% |
| 4 | 423 | 0.2% |
| 5 | 135 | < 0.1% |
| 6 | 69 | < 0.1% |
| 7 | 32 | < 0.1% |
| 8 | 10 | < 0.1% |
| 10 | 9 | < 0.1% |
| Other values (7) | 16 | < 0.1% |
| (Missing) | 14981 | 5.3% |
| Value | Count | Frequency (%) |
| 0 | 213857 | |
| 1 | 41574 | 14.8% |
| 2 | 7900 | 2.8% |
| 3 | 1784 | 0.6% |
| 4 | 423 | 0.2% |
| 5 | 135 | < 0.1% |
| 6 | 69 | < 0.1% |
| 7 | 32 | < 0.1% |
| 8 | 10 | < 0.1% |
| 9 | 4 | < 0.1% |
| Value | Count | Frequency (%) |
| 99 | 2 | < 0.1% |
| 16 | 1 | < 0.1% |
| 15 | 1 | < 0.1% |
| 14 | 1 | < 0.1% |
| 12 | 2 | < 0.1% |
| 11 | 5 | < 0.1% |
| 10 | 9 | < 0.1% |
| 9 | 4 | < 0.1% |
| 8 | 10 | < 0.1% |
| 7 | 32 |
DT_NASC
Date
| Distinct | 3507 |
|---|---|
| Distinct (%) | 1.3% |
| Missing | 1645 |
| Missing (%) | 0.6% |
| Memory size | 2.1 MiB |
| Minimum | 2001-01-10 00:00:00 |
|---|---|
| Maximum | 2024-11-14 00:00:00 |
| Invalid dates | 0 |
| Invalid dates (%) | 0.0% |
BAIXO_PESO
Boolean
High correlation Imbalance
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 274.3 KiB |
| False | |
|---|---|
| True | 24717 |
| Value | Count | Frequency (%) |
| False | 256073 | |
| True | 24717 | 8.8% |
Interactions
Correlations
| BAIXO_PESO | CONSPRENAT | CONSULTAS | ESCMAE2010 | GESTACAO | GRAVIDEZ | IDADEMAE | KOTELCHUCK | MESPRENAT | PARIDADE | PARTO | PESO | QTDFILMORT | QTDFILVIVO | QTDPARTCES | QTDPARTNOR | RACACOR | RACACORMAE | SEMAGESTAC | SEXO | STCESPARTO | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| BAIXO_PESO | 1.000 | 0.084 | 0.126 | 0.014 | 0.522 | 0.277 | 0.034 | 0.124 | 0.028 | 0.030 | 0.017 | 0.674 | 0.004 | 0.004 | 0.000 | 0.000 | 0.017 | 0.017 | 0.538 | 0.020 | 0.028 |
| CONSPRENAT | 0.084 | 1.000 | 0.543 | 0.227 | 0.140 | 0.017 | 0.170 | 0.620 | -0.401 | 0.052 | 0.073 | 0.115 | 0.013 | -0.146 | 0.015 | -0.167 | -0.181 | -0.181 | 0.111 | 0.000 | 0.061 |
| CONSULTAS | 0.126 | 0.543 | 1.000 | 0.119 | 0.094 | 0.019 | 0.073 | 0.596 | 0.531 | 0.084 | 0.114 | 0.078 | 0.000 | 0.016 | 0.003 | 0.018 | 0.085 | 0.085 | 0.094 | 0.009 | 0.092 |
| ESCMAE2010 | 0.014 | 0.227 | 0.119 | 1.000 | -0.007 | 0.023 | 0.285 | 0.183 | -0.242 | 0.177 | 0.174 | 0.008 | -0.036 | -0.280 | 0.026 | -0.319 | -0.254 | -0.254 | -0.083 | 0.003 | 0.144 |
| GESTACAO | 0.522 | 0.140 | 0.094 | -0.007 | 1.000 | 0.137 | -0.028 | 0.069 | 0.009 | 0.012 | 0.047 | 0.333 | -0.027 | 0.006 | 0.003 | 0.001 | 0.018 | 0.018 | 0.615 | 0.008 | 0.031 |
| GRAVIDEZ | 0.277 | 0.017 | 0.019 | 0.023 | 0.137 | 1.000 | 0.126 | 0.015 | 0.020 | 0.011 | 0.390 | 0.152 | 0.001 | 0.006 | 0.000 | 0.005 | 0.014 | 0.014 | 0.139 | 0.015 | 0.046 |
| IDADEMAE | 0.034 | 0.170 | 0.073 | 0.285 | -0.028 | 0.126 | 1.000 | 0.125 | -0.162 | 0.352 | 0.204 | 0.043 | 0.188 | 0.380 | 0.263 | 0.215 | -0.156 | -0.156 | -0.108 | 0.000 | 0.121 |
| KOTELCHUCK | 0.124 | 0.620 | 0.596 | 0.183 | 0.069 | 0.015 | 0.125 | 1.000 | -0.492 | 0.079 | 0.114 | 0.056 | 0.010 | -0.112 | 0.012 | -0.132 | -0.104 | -0.111 | 0.040 | 0.014 | 0.097 |
| MESPRENAT | 0.028 | -0.401 | 0.531 | -0.242 | 0.009 | 0.020 | -0.162 | -0.492 | 1.000 | 0.014 | 0.042 | -0.009 | -0.017 | 0.134 | -0.028 | 0.162 | 0.157 | 0.157 | 0.060 | 0.003 | 0.061 |
| PARIDADE | 0.030 | 0.052 | 0.084 | 0.177 | 0.012 | 0.011 | 0.352 | 0.079 | 0.014 | 1.000 | 0.024 | 0.080 | 0.007 | 0.022 | 0.009 | 0.024 | 0.064 | 0.064 | 0.033 | 0.000 | 0.039 |
| PARTO | 0.017 | 0.073 | 0.114 | 0.174 | 0.047 | 0.390 | 0.204 | 0.114 | 0.042 | 0.024 | 1.000 | 0.052 | 0.000 | 0.013 | 0.000 | 0.015 | 0.112 | 0.112 | 0.079 | 0.014 | 0.703 |
| PESO | 0.674 | 0.115 | 0.078 | 0.008 | 0.333 | 0.152 | 0.043 | 0.056 | -0.009 | 0.080 | 0.052 | 1.000 | -0.002 | 0.091 | 0.067 | 0.047 | 0.017 | 0.016 | 0.386 | 0.097 | 0.044 |
| QTDFILMORT | 0.004 | 0.013 | 0.000 | -0.036 | -0.027 | 0.001 | 0.188 | 0.010 | -0.017 | 0.007 | 0.000 | -0.002 | 1.000 | 0.179 | 0.112 | 0.170 | 0.019 | 0.019 | -0.039 | 0.000 | 0.000 |
| QTDFILVIVO | 0.004 | -0.146 | 0.016 | -0.280 | 0.006 | 0.006 | 0.380 | -0.112 | 0.134 | 0.022 | 0.013 | 0.091 | 0.179 | 1.000 | 0.473 | 0.717 | 0.116 | 0.116 | -0.005 | 0.004 | 0.010 |
| QTDPARTCES | 0.000 | 0.015 | 0.003 | 0.026 | 0.003 | 0.000 | 0.263 | 0.012 | -0.028 | 0.009 | 0.000 | 0.067 | 0.112 | 0.473 | 1.000 | -0.204 | -0.046 | -0.046 | -0.061 | 0.000 | 0.000 |
| QTDPARTNOR | 0.000 | -0.167 | 0.018 | -0.319 | 0.001 | 0.005 | 0.215 | -0.132 | 0.162 | 0.024 | 0.015 | 0.047 | 0.170 | 0.717 | -0.204 | 1.000 | 0.165 | 0.165 | 0.036 | 0.003 | 0.012 |
| RACACOR | 0.017 | -0.181 | 0.085 | -0.254 | 0.018 | 0.014 | -0.156 | -0.104 | 0.157 | 0.064 | 0.112 | 0.017 | 0.019 | 0.116 | -0.046 | 0.165 | 1.000 | 1.000 | 0.075 | 0.000 | 0.121 |
| RACACORMAE | 0.017 | -0.181 | 0.085 | -0.254 | 0.018 | 0.014 | -0.156 | -0.111 | 0.157 | 0.064 | 0.112 | 0.016 | 0.019 | 0.116 | -0.046 | 0.165 | 1.000 | 1.000 | 0.073 | 0.000 | 0.121 |
| SEMAGESTAC | 0.538 | 0.111 | 0.094 | -0.083 | 0.615 | 0.139 | -0.108 | 0.040 | 0.060 | 0.033 | 0.079 | 0.386 | -0.039 | -0.005 | -0.061 | 0.036 | 0.075 | 0.073 | 1.000 | 0.016 | 0.079 |
| SEXO | 0.020 | 0.000 | 0.009 | 0.003 | 0.008 | 0.015 | 0.000 | 0.014 | 0.003 | 0.000 | 0.014 | 0.097 | 0.000 | 0.004 | 0.000 | 0.003 | 0.000 | 0.000 | 0.016 | 1.000 | 0.000 |
| STCESPARTO | 0.028 | 0.061 | 0.092 | 0.144 | 0.031 | 0.046 | 0.121 | 0.097 | 0.061 | 0.039 | 0.703 | 0.044 | 0.000 | 0.010 | 0.000 | 0.012 | 0.121 | 0.121 | 0.079 | 0.000 | 1.000 |
Missing values
Sample
| PESO | IDADEMAE | ESCMAE2010 | CONSULTAS | CONSPRENAT | MESPRENAT | KOTELCHUCK | GRAVIDEZ | QTDPARTNOR | QTDPARTCES | PARTO | STCESPARTO | SEMAGESTAC | GESTACAO | SEXO | RACACORMAE | PARIDADE | RACACOR | QTDFILVIVO | QTDFILMORT | DT_NASC | BAIXO_PESO | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 3000.0 | 18.0 | 4.0 | 3.0 | 6.0 | 3.0 | 4 | 1.0 | 0.0 | 0.0 | 1.0 | NaN | 38.0 | 5.0 | 0.0 | 4.0 | 0 | 4.0 | 0.0 | 0.0 | 2014-01-15 | False |
| 1 | 3994.0 | 35.0 | 3.0 | 3.0 | 5.0 | 5.0 | 2 | 1.0 | 0.0 | 2.0 | 2.0 | 2.0 | 38.0 | 5.0 | 0.0 | 4.0 | 1 | 4.0 | 2.0 | 1.0 | 2014-02-19 | False |
| 2 | 2820.0 | 32.0 | 1.0 | 3.0 | 6.0 | 3.0 | 4 | 1.0 | 1.0 | NaN | 1.0 | NaN | 38.0 | 5.0 | 1.0 | NaN | 1 | NaN | 1.0 | NaN | 2014-05-19 | False |
| 3 | 3000.0 | 17.0 | 2.0 | 4.0 | 7.0 | 3.0 | 5 | 1.0 | 1.0 | NaN | 1.0 | NaN | 43.0 | 6.0 | 0.0 | 4.0 | 1 | 4.0 | 1.0 | NaN | 2014-02-20 | False |
| 4 | 2690.0 | 21.0 | 4.0 | 3.0 | 6.0 | 2.0 | 4 | 1.0 | NaN | NaN | 1.0 | NaN | 39.0 | 5.0 | 0.0 | 4.0 | 0 | 4.0 | NaN | NaN | 2014-06-10 | False |
| 5 | 2210.0 | 18.0 | 2.0 | 3.0 | 6.0 | 5.0 | 2 | 1.0 | 0.0 | 0.0 | 2.0 | 2.0 | 41.0 | 5.0 | 0.0 | 4.0 | 0 | 4.0 | 0.0 | 0.0 | 2014-06-30 | True |
| 6 | 2710.0 | 29.0 | 3.0 | 1.0 | NaN | 3.0 | 9 | 1.0 | 0.0 | 0.0 | 2.0 | 9.0 | 39.0 | 5.0 | 1.0 | 4.0 | 0 | 4.0 | 0.0 | 0.0 | 2014-04-28 | False |
| 7 | 2880.0 | 18.0 | 3.0 | 3.0 | 4.0 | 3.0 | 3 | 1.0 | 0.0 | 0.0 | 1.0 | NaN | 36.0 | 4.0 | 1.0 | 4.0 | 0 | 4.0 | 0.0 | 0.0 | 2014-02-26 | False |
| 8 | 3455.0 | 19.0 | 3.0 | 3.0 | 5.0 | 99.0 | 9 | 1.0 | 0.0 | 1.0 | 2.0 | 2.0 | 37.0 | 5.0 | 0.0 | 1.0 | 1 | 1.0 | 1.0 | 0.0 | 2015-03-23 | False |
| 9 | 3280.0 | 20.0 | 2.0 | 4.0 | 9.0 | 5.0 | 2 | 1.0 | 0.0 | 0.0 | 2.0 | 1.0 | 40.0 | 5.0 | 1.0 | 4.0 | 0 | 4.0 | 0.0 | 0.0 | 2014-07-22 | False |
| PESO | IDADEMAE | ESCMAE2010 | CONSULTAS | CONSPRENAT | MESPRENAT | KOTELCHUCK | GRAVIDEZ | QTDPARTNOR | QTDPARTCES | PARTO | STCESPARTO | SEMAGESTAC | GESTACAO | SEXO | RACACORMAE | PARIDADE | RACACOR | QTDFILVIVO | QTDFILMORT | DT_NASC | BAIXO_PESO | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 280780 | 2885.0 | 31.0 | 5.0 | 4.0 | 7.0 | 2.0 | 5 | 1.0 | 0.0 | 0.0 | 2.0 | 1.0 | 37.0 | 5.0 | NaN | 1.0 | 0 | 1.0 | 0.0 | 0.0 | 2024-03-30 | False |
| 280781 | 2310.0 | 30.0 | 5.0 | 4.0 | 8.0 | 1.0 | 5 | 2.0 | 0.0 | 2.0 | 2.0 | 1.0 | 34.0 | 4.0 | NaN | 4.0 | 1 | 4.0 | 2.0 | 0.0 | 2023-11-23 | True |
| 280782 | 3205.0 | 24.0 | 3.0 | 4.0 | 7.0 | 2.0 | 5 | 1.0 | 1.0 | 0.0 | 1.0 | 3.0 | 39.0 | 5.0 | NaN | 4.0 | 1 | 4.0 | 1.0 | 0.0 | 2023-12-06 | False |
| 280783 | 2614.0 | 33.0 | 3.0 | 3.0 | 4.0 | 7.0 | 2 | 1.0 | 0.0 | 2.0 | 2.0 | 1.0 | 39.0 | 5.0 | NaN | 4.0 | 1 | 4.0 | 2.0 | 1.0 | 2023-09-15 | False |
| 280784 | 3525.0 | 15.0 | 3.0 | 4.0 | 8.0 | 3.0 | 5 | 1.0 | 0.0 | 0.0 | 1.0 | 3.0 | 41.0 | 5.0 | NaN | 4.0 | 0 | 4.0 | 0.0 | 0.0 | 2023-08-25 | False |
| 280785 | 3230.0 | 37.0 | 5.0 | 4.0 | 11.0 | 2.0 | 5 | 1.0 | 0.0 | 1.0 | 2.0 | 2.0 | 39.0 | 5.0 | NaN | 4.0 | 1 | 4.0 | 1.0 | 0.0 | 2023-10-11 | False |
| 280786 | 3155.0 | 25.0 | 2.0 | 4.0 | 8.0 | 2.0 | 5 | 1.0 | 0.0 | 1.0 | 1.0 | 3.0 | 35.0 | 4.0 | NaN | 4.0 | 1 | 4.0 | 1.0 | 0.0 | 2024-01-22 | False |
| 280787 | 3040.0 | 24.0 | 5.0 | 4.0 | 9.0 | 2.0 | 5 | 1.0 | 0.0 | 1.0 | 2.0 | 2.0 | 38.0 | 5.0 | NaN | 4.0 | 1 | 4.0 | 1.0 | 0.0 | 2024-02-05 | False |
| 280788 | 3410.0 | 37.0 | 5.0 | 4.0 | 12.0 | 2.0 | 5 | 1.0 | 0.0 | 0.0 | 2.0 | 2.0 | 39.0 | 5.0 | NaN | 1.0 | 1 | 1.0 | 0.0 | 1.0 | 2023-02-21 | False |
| 280789 | 3795.0 | 26.0 | 3.0 | 4.0 | 10.0 | 1.0 | 5 | 1.0 | 1.0 | 0.0 | 1.0 | 3.0 | 39.0 | 5.0 | NaN | 1.0 | 1 | 1.0 | 1.0 | 0.0 | 2024-03-12 | False |